Optimizing Text Categorization for Indonesian Text Using Clustering Label Technique

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Text Categorization Using Trend-Tracking Technique

In this paper, we propose a novel text categorization method using trendtracking technique. The method classies texts by tracking the transition of information in them. Therefore, it can deal especially well with texts whose content transits gradually with the passage of time, such as Internet news articles, newspaper articles, or web pages which are often updated. Experimental results show tha...

متن کامل

Improving Methods for Single-label Text Categorization

As the volume of information in digital form increases, the use of Text Categorization techniques aimed at finding relevant information becomes more necessary. To improve the quality of the classification, I propose the combination of different classification methods. The results show that k-NN-LSI, the combination of k-NNwith LSI, presents an average Accuracy on the five datasets that is highe...

متن کامل

Selection Strategies for Multi-label Text Categorization

In multi-label text categorization, determining the final set of classes that will label a given document is not trivial. It implies first to determine whether a class is suitable of being attached to the text and, secondly, the number of them that we have to consider. Different strategies for determining the size of the final set of assigned labels are studied here. We analyze several classifi...

متن کامل

Two-dimensional Clustering for Text Categorization

We propose a new method to improve the accuracy of Text Categorization using twodimensional clustering. In a number of previous probabilistic approaches, texts in the same category are implicitly assumed to be generated from an identical distribution. We empirically show that this assumption is not accurate, and propose a new framework based on twodimensional clustering to alleviate this proble...

متن کامل

Automatic Word Clustering for Text Categorization Using Global Information

This paper presents a cluster-based text categorization system which uses class distributional clustering of words. We propose a new clustering model which considers the global information over all the clusters. The model can group words into clusters based on the distribution of class labels associated with each word. Using these learned clusters as features, we develop a cluster-based classif...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Turkish Journal of Computer and Mathematics Education (TURCOMAT)

سال: 2021

ISSN: 1309-4653

DOI: 10.17762/turcomat.v12i3.947